Large-scale analysis of conserved rare codon clusters suggests an involvement in co-translational molecular recognition events

نویسندگان

  • Matthieu Chartier
  • Francis Gaudreault
  • Rafael Najmanovich
چکیده

MOTIVATION An increasing amount of evidence from experimental and computational analysis suggests that rare codon clusters are functionally important for protein activity. Most of the studies on rare codon clusters were performed on a limited number of proteins or protein families. In the present study, we present the Sherlocc program and how it can be used for large scale protein family analysis of evolutionarily conserved rare codon clusters and their relation to protein function and structure. This large-scale analysis was performed using the whole Pfam database covering over 70% of the known protein sequence universe. Our program Sherlocc, detects statistically relevant conserved rare codon clusters and produces a user-friendly HTML output. RESULTS Statistically significant rare codon clusters were detected in a multitude of Pfam protein families. The most statistically significant rare codon clusters were predominantly identified in N-terminal Pfam families. Many of the longest rare codon clusters are found in membrane-related proteins which are required to interact with other proteins as part of their function, for example in targeting or insertion. We identified some cases where rare codon clusters can play a regulating role in the folding of catalytically important domains. Our results support the existence of a widespread functional role for rare codon clusters across species. Finally, we developed an online filter-based search interface that provides access to Sherlocc results for all Pfam families. AVAILABILITY The Sherlocc program and search interface are open access and are available at http://bcb.med.usherbrooke.ca

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Widespread position-specific conservation of synonymous rare codons within coding sequences

Synonymous rare codons are considered to be sub-optimal for gene expression because they are translated more slowly than common codons. Yet surprisingly, many protein coding sequences include large clusters of synonymous rare codons. Rare codons at the 5' terminus of coding sequences have been shown to increase translational efficiency. Although a general functional role for synonymous rare cod...

متن کامل

Modeling translation elongation dynamics by deep learning reveals new insights into the landscape of ribosome stalling

Translation elongation plays a central role in multiple aspects of protein biogenesis, e.g., differential expression, cotranslational folding and secretion. However, our current understanding on the regulatory mechanisms underlying translation elongation dynamics and the functional roles of ribosome stalling in protein synthesis still remains largely limited. Here, we present a deep learning-ba...

متن کامل

The Characteristics of Rare Codon Clusters in the Genome and Proteins of Hepatitis C Virus; a Bioinformatics Look

BACKGROUND Recent studies suggest that rare codon clusters are functionally important for protein activity. METHODS Here, for the first time we analyzed and reported rare codon clusters in Hepatitis C Virus (HCV) genome and then identified the location of these rare codon clusters in the structure of HCV protein. This analysis was performed using the Sherlocc program that detects statistically ...

متن کامل

Structural Basis for Translation Termination on a Pseudouridylated Stop Codon.

Pseudouridylation of messenger RNA emerges as an abundant modification involved in gene expression regulation. Pseudouridylation of stop codons in eukaryotic and bacterial cells results in stop-codon read through. The structural mechanism of this phenomenon is not known. Here we present a 3.1-Å crystal structure of Escherichia coli release factor 1 (RF1) bound to the 70S ribosome in response to...

متن کامل

Construction and Eukaryotic Expression of Recombinant Large Hepatitis Delta Antigen

Background: Hepatitis delta virus (HDV) is a subviral human pathogen that exploits host RNA editing activity to produce two essential forms of the sole viral protein, hepatitis delta antigen (HDAg). Editing at the amber/W site of HDV antigenomic RNA leads to the production of the large form (L-HDAg), which is required for RNA packaging. Methods: In this study, PCR-based site-directed mutagen...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:
  • Bioinformatics

دوره 28 11  شماره 

صفحات  -

تاریخ انتشار 2012